Search CORE

17 research outputs found

Proteochemometric modeling of HIV protease susceptibility

Author: Eklund Martin
Lapins Maris
Prusis Peteris
Spjuth Ola
Wikberg Jarl ES
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background A major obstacle in treatment of HIV is the ability of the virus to mutate rapidly into drug-resistant variants. A method for predicting the susceptibility of mutated HIV strains to antiviral agents would provide substantial clinical benefit as well as facilitate the development of new candidate drugs. Therefore, we used proteochemometrics to model the susceptibility of HIV to protease inhibitors in current use, utilizing descriptions of the physico-chemical properties of mutated HIV proteases and 3D structural property descriptions for the protease inhibitors. The descriptions were correlated to the susceptibility data of 828 unique HIV protease variants for seven protease inhibitors in current use; the data set comprised 4792 protease-inhibitor combinations. Results The model provided excellent predictability (<it>R</it>2 = 0.92, <it>Q</it>2 = 0.87) and identified general and specific features of drug resistance. The model's predictive ability was verified by external prediction in which the susceptibilities to each one of the seven inhibitors were omitted from the data set, one inhibitor at a time, and the data for the six remaining compounds were used to create new models. This analysis showed that the over all predictive ability for the omitted inhibitors was <it>Q</it>2 <it>inhibitors </it>= 0.72. Conclusion Our results show that a proteochemometric approach can provide generalized susceptibility predictions for new inhibitors. Our proteochemometric model can directly analyze inhibitor-protease interactions and facilitate treatment selection based on viral genotype. The model is available for public use, and is located at HIV Drug Research Centre.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Prediction of indirect interactions in proteins

Author: Lapinsh Maris
Petrovska Ramona
Prusis Peteris
Uhlén Staffan
Wikberg Jarl ES
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Both direct and indirect interactions determine molecular recognition of ligands by proteins. Indirect interactions can be defined as effects on recognition controlled from distant sites in the proteins, e.g. by changes in protein conformation and mobility, whereas direct interactions occur in close proximity of the protein's amino acids and the ligand. Molecular recognition is traditionally studied using three-dimensional methods, but with such techniques it is difficult to predict the effects caused by mutational changes of amino acids located far away from the ligand-binding site. We recently developed an approach, proteochemometrics, to the study of molecular recognition that models the chemical effects involved in the recognition of ligands by proteins using statistical sampling and mathematical modelling. RESULTS: A proteochemometric model was built, based on a statistically designed protein library's (melanocortin receptors') interaction with three peptides and used to predict which amino acids and sequence fragments that are involved in direct and indirect ligand interactions. The model predictions were confirmed by directed mutagenesis. The predicted presumed direct interactions were in good agreement with previous three-dimensional studies of ligand recognition. However, in addition the model could also correctly predict the location of indirect effects on ligand recognition arising from distant sites in the receptors, something that three-dimensional modelling could not afford. CONCLUSION: We demonstrate experimentally that proteochemometric modelling can be used with high accuracy to predict the site of origin of direct and indirect effects on ligand recognitions by proteins

University of Bergen

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

NORA - Norwegian Open Research Archives

Linking the Resource Description Framework to cheminformatics and proteochemometrics

Author: Alvarsson Jonathan
Andersson Annsofie
Eklund Martin
Lampa Samuel
Lapins Maris
Spjuth Ola
Wikberg Jarl ES
Willighagen Egon L
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Semantic web technologies are finding their way into the life sciences. Ontologies and semantic markup have already been used for more than a decade in molecular sciences, but have not found widespread use yet. The semantic web technology Resource Description Framework (RDF) and related methods show to be sufficiently versatile to change that situation. Results The work presented here focuses on linking RDF approaches to existing molecular chemometrics fields, including cheminformatics, QSAR modeling and proteochemometrics. Applications are presented that link RDF technologies to methods from statistics and cheminformatics, including data aggregation, visualization, chemical identification, and property prediction. They demonstrate how this can be done using various existing RDF standards and cheminformatics libraries. For example, we show how IC50 and K<it>i</it> values are modeled for a number of biological targets using data from the ChEMBL database. Conclusions We have shown that existing RDF standards can suitably be integrated into existing molecular chemometrics methods. Platforms that unite these technologies, like Bioclipse, makes this even simpler and more transparent. Being able to create and share workflows that integrate data aggregation and analysis (visual and statistical) is beneficial to interoperability and reproducibility. The current work shows that RDF approaches are sufficiently powerful to support molecular chemometrics workflows.</p

Maastricht University Research Portal

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The C1C2: A framework for simultaneous model selection and assessment

Author: A Golbraikh
A Kontijevskis
AE Hoerl
B Efron
B Efron
C Hansch
D Wolpert
DE Goldberg
DL Selwood
E Amaldi
E Freyhult
EE Ntzani
G Schwarz
H Akaike
H Kubinyi
H Shimodaira
J Cartmell
J Cartmell
J Kuha
J Shao
Jarl ES Wikberg
JES Wikberg
L Wasserman
LJ van't Veer
M Skurichina
M Stone
Martin Eklund
O Nicolotti
O Obrezanova
O Spjuth
Ola Spjuth
P Burman
R Tibshirani
R Todeschini
RE Kass
S Michiels
SJ Cho
SR Johnson
T Hastie
TR Hvidsten
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Bioclipse: an open source workbench for chemo- and bioinformatics

Author: A Siepel
C Gille
C Gille
C Steinbeck
C Steinbeck
Christoph Steinbeck
E Badidi
E Willighagen
Egon L Willighagen
Jarl ES Wikberg
Johannes Wagener
M Munk
Martin Eklund
Ola Spjuth
P Murray-Rust
P Murray-Rust
P Rice
PB Neerincx
Peter Murray-Rust
PT Shannon
R Guha
S Krause
Stefan Kuhn
T Carver
T Oinn
Tobias Helmus
V Curcin
W DeLano
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: There is a need for software applications that provide users with a complete and extensible toolkit for chemo- and bioinformatics accessible from a single workbench. Commercial packages are expensive and closed source, hence they do not allow end users to modify algorithms and add custom functionality. Existing open source projects are more focused on providing a framework for integrating existing, separately installed bioinformatics packages, rather than providing user-friendly interfaces. No open source chemoinformatics workbench has previously been published, and no sucessful attempts have been made to integrate chemo- and bioinformatics into a single framework. RESULTS: Bioclipse is an advanced workbench for resources in chemo- and bioinformatics, such as molecules, proteins, sequences, spectra, and scripts. It provides 2D-editing, 3D-visualization, file format conversion, calculation of chemical properties, and much more; all fully integrated into a user-friendly desktop application. Editing supports standard functions such as cut and paste, drag and drop, and undo/redo. Bioclipse is written in Java and based on the Eclipse Rich Client Platform with a state-of-the-art plugin architecture. This gives Bioclipse an advantage over other systems as it can easily be extended with functionality in any desired direction. CONCLUSION: Bioclipse is a powerful workbench for bio- and chemoinformatics as well as an advanced integration platform. The rich functionality, intuitive user interface, and powerful plugin architecture make Bioclipse the most advanced and user-friendly open source workbench for chemo- and bioinformatics. Bioclipse is released under Eclipse Public License (EPL), an open source license which sets no constraints on external plugin licensing; it is totally open for both open source plugins as well as commercial ones. Bioclipse is freely available at

Maastricht University Research Portal

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Publikationer från Uppsala Universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Kinome-wide interaction modelling using alignment-based and alignment-independent approaches for kinase description and linear and non-linear data analysis techniques

Author: A Golbraikh
A Kamb
A Linusson
A Navia-Vázquez
B Ustün
CC Chang
D Aha
E Freyhult
G Cruciani
G Manning
G Scapin
H Daub
H Drucker
HM Berman
I Dubchak
IH Witten
J Trygg
Jarl ES Wikberg
JD Griffin
JE Wikberg
JE Wikberg
K Illergård
KC Chou
KC Chou
LH Alifrangis
M Bhasin
M Bhasin
M Bhasin
M Lapinsh
M Reczko
M Sandberg
M Van Heel
MA Fabian
MA Larkin
Maris Lapins
MS Cohen
MW Karaman
NP Shah
O Devos
P Bamborough
P Geladi
QB Gao
RJ Quinlan
S Hua
S Madhusudan
S Wold
S Wold
S Wold
SD Peterson
T Lundstedt
TA Carter
V Vapnik
ZR Li
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Protein kinases play crucial roles in cell growth, differentiation, and apoptosis. Abnormal function of protein kinases can lead to many serious diseases, such as cancer. Kinase inhibitors have potential for treatment of these diseases. However, current inhibitors interact with a broad variety of kinases and interfere with multiple vital cellular processes, which causes toxic effects. Bioinformatics approaches that can predict inhibitor-kinase interactions from the chemical properties of the inhibitors and the kinase macromolecules might aid in design of more selective therapeutic agents, that show better efficacy and lower toxicity. Results We applied proteochemometric modelling to correlate the properties of 317 wild-type and mutated kinases and 38 inhibitors (12,046 inhibitor-kinase combinations) to the respective combination's interaction dissociation constant (Kd). We compared six approaches for description of protein kinases and several linear and non-linear correlation methods. The best performing models encoded kinase sequences with amino acid physico-chemical z-scale descriptors and used support vector machines or partial least- squares projections to latent structures for the correlations. Modelling performance was estimated by double cross-validation. The best models showed high predictive ability; the squared correlation coefficient for new kinase-inhibitor pairs ranging P2 = 0.67-0.73; for new kinases it ranged P2kin = 0.65-0.70. Models could also separate interacting from non-interacting inhibitor-kinase pairs with high sensitivity and specificity; the areas under the ROC curves ranging AUC = 0.92-0.93. We also investigated the relationship between the number of protein kinases in the dataset and the modelling results. Using only 10% of all data still a valid model was obtained with P2 = 0.47, P2kin = 0.42 and AUC = 0.83. Conclusions Our results strongly support the applicability of proteochemometrics for kinome-wide interaction modelling. Proteochemometrics might be used to speed-up identification and optimization of protein kinase targeted and multi-targeted inhibitors.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

XMPP for cloud computing in bioinformatics supporting discovery and invocation of asynchronous web services

Author: A Labarga
AR Jones
B Wallner
BioMoby Consortium
C Steinbeck
C Steinbeck
D Smedley
E Jain
E Willighagen
Egon L Willighagen
EW Sayers
GL Holliday
H Stockinger
H Sugawara
Jarl ES Wikberg
Johannes Wagener
L Stein
LM Vaquero
M Hucka
M Lapins
MA Larkin
MD Wilkinson
MWEJ Fiers
N Adams
O Spjuth
Ola Spjuth
P Fisher
P Murray-Rust
PBT Neerincx
R Kottmann
RD Dowell
S Hoon
S Hunter
S Kaarthik
S Kerrien
S Kuhn
S Miyazaki
T Oinn
UniProt Consortium
X Dong
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Background: Life sciences make heavily use of the web for both data provision and analysis. However, the increasing amount of available data and the diversity of analysis tools call for machine accessible interfaces in order to be effective. HTTP-based Web service technologies, like the Simple Object Access Protocol (SOAP) and REpresentational State Transfer (REST) services, are today the most common technologies for this in bioinformatics. However, these methods have severe drawbacks, including lack of discoverability, and the inability for services to send status notifications. Several complementary workarounds have been proposed, but the results are ad-hoc solutions of varying quality that can be difficult to use. Results: We present a novel approach based on the open standard Extensible Messaging and Presence Protocol (XMPP), consisting of an extension (IO Data) to comprise discovery, asynchronous invocation, and definition of data types in the service. That XMPP cloud services are capable of asynchronous communication implies that clients do not have to poll repetitively for status, but the service sends the results back to the client upon completion. Implementations for Bioclipse and Taverna are presented, as are various XMPP cloud services in bio- and cheminformatics. Conclusion: XMPP with its extensions is a powerful protocol for cloud services that demonstrate several advantages over traditional HTTP-based Web services: 1) services are discoverable without the need of an external registry, 2) asynchronous invocation eliminates the need for ad-hoc solutions like polling, and 3) input and output types defined in the service allows for generation of clients on the fly without the need of an external semantics description. The many advantages over existing technologies make XMPP a highly interesting candidate for next generation online services in bioinformatics

Maastricht University Research Portal

Crossref

Springer - Publisher Connector

PubMed Central

Open Access LMU

An eScience-Bayes strategy for analyzing omics data

Author: A Gelman
A Gelman
A Isaksson
BP Carlin
C Desmedt
C Sotiriou
CF Taylor
CN Chi
CP Robert
D Milburn
D Muthas
D Talavera
EC Butcher
EL Kaplan
H Chuang
H Daumé III
HB Mann
HM Berman
Jarl ES Wikberg
JO Berger
JR Chen
L Ein-Dor
L Xu
LD Miller
M Xiao-Li
MA Stiffler
Martin Eklund
N Sha
O Spjuth
Ola Spjuth
P Murray-Rust
P Prusis
PCG da Costa
R Development Core Team
R Edgar
R Tonikian
RG Smock
RL Ho
S Gianni
S Lockless
S Michiels
SR Eddy
U Wickenberg-Bolin
Y Pawitan
Y Wang
Z Kutalik
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in <it>ad hoc </it>approaches to address specific problems. Results We present a general approach to omics data analysis that alleviates these problems. By combining eScience and Bayesian methods, we retrieve scientific information and data from multiple sources and coherently incorporate them into large models. These models improve the accuracy of predictions and offer new insights into the underlying mechanisms. This "eScience-Bayes" approach is demonstrated in two proof-of-principle applications, one for breast cancer prognosis prediction from transcriptomic data and one for protein-protein interaction studies based on proteomic data. Conclusions Bayesian statistics provide the flexibility to tailor statistical models to the complex data structures in omics biology as well as permitting coherent integration of multiple data sources. However, Bayesian methods are in general computationally demanding and require specification of possibly thousands of prior distributions. eScience can help us overcome these difficulties. The eScience-Bayes thus approach permits us to fully leverage on the advantages of Bayesian methods, resulting in models with improved predictive performance that gives more information about the underlying biological system.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central